Research Article
Design Novel Peptide Derived from Protein-Protein Interfaces of 3CL-Pro SARS/CoV-2 and ACE-II Enzyme Complexes for Inhibition of COVID-19 Virus Replications by Computational Approach
K Rajaganapathy*, Ramalingam Sathiyasundar, D Suresh Lingam, Chidambaram Ganapathy and Shaik Mohammad Sameer
Corresponding Author: Rajaganapathy Kaliyaperumal, Department of Genomics, Bogar Bio Bee Stores Pvt Ltd, No-26, 14th Cross Main Road, New Thillai Nagar, Vadavalli, Coimbatore-641041, Tamil Nadu, India
Received: May 05, 2020; Revised: June 11, 2020; Accepted: June 09, 2020
Citation: K Rajaganapathy, Sathiyasundar R, D Suresh Lingam, Ganapathy C & Sameer SM. (2020) Design Novel Peptide Derived from Protein-Protein Interfaces of 3CL-Pro SARS/CoV-2 and ACE-II Enzyme Complexes for Inhibition of COVID-19 Virus Replications by Computational Approach. Proteomics Bioinformatics, 3(1): 135-153.
Copyrights: ©2021 K Rajaganapathy, Sathiyasundar R, D Suresh Lingam, Ganapathy C & Sameer SM. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Share :
  • 878

    Views & Citations
  • 10

    Likes & Shares

The novel coronavirus disease 2019 (COVID-19 or SARS/CoV-2) is rapidly expanding infection around the globe. This infection causes many deaths on a daily basis all over the world. Therapeutic options are currently limited. There is an emergent need of finding potential therapeutic agents for COVID-19. As a key enzyme in the life-cycle of corona-virus, the 3C-like main protease (3CL-Pro or M--Pro) is the most attractive for antiviral drug design. Therefore, current work are focused on RNA interference silences gene expression in COVID-19 through short interfering of SARS/CoV-2-RNA segments that guide mRNA degradation in a sequence-specific fashion at blocking the 3CL-Pro-SARS/CoV-2 active site further to block the proteins accession on infected human host cells leads to inhibit the virus replications. Thus, we developed the Structure-based drug design, based on a recently solved structure 3CL-Pro-SARS/CoV-2 (PDB ID: 6LU7 its 99% identical of COVID-19 reported and 6VW1 is a SARS/CoV-2 3CL-Pro complexed with ACE-II), for the intention of generating novel peptides has been derived from  protein-protein interfaces of SARS-CoV-2-3CL-Pro and ACE-II (it’s an entry receptor for COVID-19) as, Yu-Chuan Chang et.al., 2020 complexes, based on the structure-based optimization, we obtained the peptide-3  as shown a near perfect dock in the overlap region of the protein pocket, the active sites inferred from three selected target proteins of 3CL-Pro/SARS-CoV-2/ACE-II are compatible with the docking score. Hence, these peptide-3 can be used as potential candidate’s leads for the inhibition of protein-protein interactions of SARS/CoV-2 and ACE-II with against the COVID-19 through binding and activation.

Keywords: SARS/CoV-2, 3CL-Proteases, COVID-19, Peptides, Protein-protein interfaces, Docking

INTRODUCTION

The novel coronavirus disease 2019 (COVID-19 or SARS/CoV-2) is a rapidly expanding infection around the globe. The COVID-19 compared to previous outbreaks of severe acute respiratory syndrome (SARS-CoV) and Middle East respiratory syndrome (MERS) is more contagious. The World Health Organization (WHO) in March 2020 announced the Coronavirus pandemic. This infection causes many deaths on a daily basis all over the world. Therapeutic options are currently limited. There is an emergent need of finding potential therapeutic agents for COVID-19[1,2].

The SARS/CoV-2 Coronaviruses are single-stranded positive-sense RNA viruses that possess large viral RNAgenomes. Recent studies showed that SARS-CoV-2 has a similar genomic organization to other beta-coronaviruses,

consisting of a 5’-untranslated region (UTR), a replicase complex (orf1ab) encoding non-structural proteins (nsps), a spike protein (S) gene, envelope protein (E) gene, a membrane protein (M) gene, a nucleocapsid protein (N) gene, 3’-UTR, and several unidentified non-structural open reading frames. Although SARS-CoV-2 is classified into the beta-coronaviruses group, it is diverse from MERS-CoV and SARS-CoV. Recent studies highlighted that SARS-CoV-2 genes share <80% nucleotide identity and 89.10% nucleotide similarity with SARS-CoV genes. Usually, beta-coronaviruses produce a ~800 kDa polypeptide upon transcription of the genome. This polypeptide is proteolytically cleaved to generate various proteins. The proteolytic processing is mediated by papain-like protease (Plpro) and 3-chymotrypsin-like protease (3Clpro or 3CLPro also known as MPro). The 3Clpro cleaves the polyprotein at 11 distinct sites to generate various non-structural proteins that are important for viral replication. 3Clpro play a critical role in the replication of virus particles and unlike structural/accessory protein-encoding genes, it is located at the 3’ end which exhibits excessive variability. Therefore, it is a potential target for anti-coronaviruses espeasiallyCOVID-19 inhibitors and vaccine screening. In addition, the alveolar epithelial cells have abundant expression of angiotensin-converting enzyme 2 (ACE2), which is targeted by the virus. The recognition of ACE2 by the S protein of the virus enables the invasion of the coronavirus into the human circulation system [1,2]. Recent study demonstrates that ACE2 is the SARS-CoV-2 receptor, which is required for cell entry [3]. Single-strand RNA (ssRNA) viruses such as the coronavirus (SARS/CoV-2) family replicate the virus genomes by taking advantage of host cells. For example, after coronavirus approaches the ribosome of the epithelial cells or other host cells, it uses the ribosome of the host cell to replicate polyproteins. The replication and subsequent processes of precursor polyproteins can occur in the epithelial cells [3]. After the coronavirus’ polyproteins are expressed, two enzymes specifically, coronavirus main proteinase (3Clpro) and the papain-like protease (Plpro) are thought to be involved in cleaving the polyproteins into smaller products used for replicating new viruses [3]. In order to generate the daughter RNA genome, the coronavirus expresses an RNA-dependent RNA polymerase (RdRp), which is a crucial replicase that catalyzes the synthesis of a complementary RNA strand using the virus RNA templates [1,2].

Therefore, current work are focused on RNA interference silences gene expression in COVID-19 through short interfering of SARS/CoV-2-RNA segments that guide mRNA degradation in a sequence-specific fashion at blocking the 3CL-Pro-SARS/CoV-2 active site further to block the proteins accession on infected human host cells leads to inhibit the virus replications. Thus, we developed the Structure-based drug design, based on a recently solved structure 3CL-Pro-SARS/CoV-2 (PDB ID: 6LU7 its 99% identical of COVID-19 reported as., Yu-Chuan Chang et.al. [1] and 6VW1 is a SARS/CoV-2 3CL-Pro complexed with ACE-II), for the intention of generating novel peptides has been derived from  protein-protein interfaces of SARS-CoV-2-3CL-Pro and ACE-II (it’s a entry receptor for COVID-19) complexes, based on the structure-based optimization, we obtained as peptide-3  shows a near perfect dock in the overlap region of the protein pocket, the active sites inferred from three selected target proteins of 3CL-Pro/SARS-CoV-2/ACE-II are compatible with the docking score.  Hence, these peptide-3 can be used as potential candidate’s leads for the inhibition of protein-protein interactions of SARS/CoV-2 and ACE-II with against the COVID-19 through binding and activation.

Although, Tin-Yun Ho et al. [2], Antiviral Research 69 (2006) 70–76 reported SARS-Cov inhibitors of small peptides derived from S protein on the binding of S protein to ACE2 and on the S-protein-pseudotyped retrovirus infectivity. SP-4 (residues 192–203), SP-8 (residues 483–494), and SP-10 (residues 668–679) significantly blocked the interaction between S protein and ACE2 by biotinylated enzyme-linked immunosorbent assay, with IC 50 values of 4.30 ± 2.18, 6.99 ± 0.71, and 1.88 ± 0.52 nmol, respectively [2,3]. Based on that, we developed peptides derived from the protein-protein interfaces regions of 3-CL protease-SARS-CoV-2 and ACE-II complexes with the aim of blocking the 3CL-Pro cleavage in SARS/CoV-2 interacting cellular receptors especially inhibit the ACE-2 receptor entry.

MATERIALS &METHODS

The Proteins/Macromolecules of COVID-19, the SARS/CoV-2 3clpro/MproPDB ID: 6LU7[Ref] structures were obtained from PDB (https://www.rcsb.org/), in the file format of dot.pdb format. The PDB is an archive for the crystal structures of biological macromolecules from worldwide [4,5]. The 6LU7 protein which contains two chains, A and C, which form a homodimer. Chain A was used for macromolecule preparation. The native ligand for 6LU7is n-[(5-methylisoxazol-3-yl)carbonyl]alanyl-l-valyl-n~1~-((1r,2z)-4-(benzyloxy)-4-oxo-1-{[(3r)-2-oxopyrrolidin-3-yl]methyl}but-2-enyl)-l-leucinamide and the PDB ID: 6VW1 is a crystal structure of 3CLPro-SARS/CoV-2 complexes with ACE-II [6-7] were also used for this studies.

Protein-protein docking

In order to mimic the 3CLPro-SARS/CoV-2 – ACE-II complex, the 3D crystal structures of proteins: 3CLPro-SARS/CoV-2 (PDB ID: 6LU7) and ACE-II (PDB ID: 6VW1_A, B) were retrieved from the Protein Data Bank. These, Protein-Protein complex prediction was mimicked by using Patch Dock. First, the 3CLPro-SARS/CoV-2 was docked with the ACE-II complex, finally to obtain a mimicked structure.

 

Protein-protein interfaces and peptide designing

The obtained 3D complex of 3CLPro-SARS/CoV-2-ACE-II was introduced to the online meta-PPISP server for the prediction of protein-protein interface region (Table 1). The interface region was interpreted and analyzed by SwissPDB-Viewer. The possible three peptide structures that mimic the interface region structures were obtained and drawn by using ChemDraw. ChemDraw has advanced chemical editor for drawing chemical structures and it has a rich list of editing features, further the obtained peptides were saved as dot.MOLformar (Tables 1-3).

In silico-high throughput structure-based screening

The three peptides of protein-protein interfaces were designed based on drug-like properties of their ability is due to the ligand to interact and inhibit the native ligand catalytic site of 3-CL Pro SARS/CoV-2 and ACE-II protein. A lead compounds or scaffolds can be identified from diversified compound pool and accelerated screening, a screened pool is focused for bio-targets to inhibit the diseases. Hence, we used structural based screening, through molecular docking [8,9] by using binding and activation with the GOLD drug discovery software.

GOLD (Genetic Optimization for Ligand Docking) is a genetic algorithm for docking flexible ligands into protein binding sites. GOLD offers a choice of scoring functions of GoldScore/ChemScore. The GOLD fitness function is made up of four components: 1st protein-ligand hydrogen bond energy (external H-bond). 2nd the protein-ligand van der Waals (vdw) energy (external vdw). Thired, ligand internal vdw energy (internal vdw) and 4th the ligand torsional strain energy (internal torsion). The Input Parameters of Protein: Allows specification of the protein input file and ligand: Thus, Added file or Update selected file button or Add all files in directory button to add the chosen ligand file or directory to Set atom types: Controls whether atom types are set manually or automatically for (a) the ligand(s) and (b) the protein for Defining the Binding Site we should specify the approximate centre and extent of the binding site. This can be done in several ways: such as from a point, from a protein atom, from a file containing a list of atoms, from a protein residue, from a file containing a list of residues and from a reference ligand and we can use cavity detection to confine the calculation to regions enclosed within concave parts of the binding site surface cavity volume, as determined by the cavity detection algorithm, can also be output and define active site from: Allows specification of the position of the binding site with respect to a point, a protein atom close to the centre of the site, a set of protein atoms lining the site, or a reference ligand and active site radius: Allows specification of the radius of the binding site and Covalent: Allows specification of a protein-ligand covalent bond.

The Fitness and Search Options used to control: the Ligand flexibility during docking, including whether ligand ring conformations are varied, whether torsion angles around ligand amide bonds and bonds to trigonal nitrogen are allowed to vary during docking, whether intramolecular hydrogen bonds are permitted between ligand atoms, and whether protonated carboxylic acids are permitted to rotate or flip and the parameters testing is based on Edit Constraints: Allows specification of distance constraints, hydrogen bond constraints, regional (hydrophobic) constraints, and binding mode similarity constraints and Van der Waals: Only available if Gold Score selected. Allows specification of the van der Waals annealing parameter and Hydrogen Bonding: Only available if Gold Score selected. Allows specification of the hydrogen-bonding annealing parameter and the Genetic Algorithm Parameters were selected GA Presets and Automatic Settings: Allows specification of speed and accuracy of docking runs to the Population Size: Allows specification of the population size and Selection Pressure: Allows specification of the selection pressure and number of Operations: Allows specification of the total number of operations to be performed in a genetic algorithm run and further number of Islands: Allows the genetic algorithm to be split over n islands, where n is user specified and Niche Size: Allows specification of the niche size to be used and Migrate/Mutate/Crossover: Controls the relative frequencies with which the three types of genetic operations occur. Thus, the complete strategy followed in this study was reversed to ensure that the identified potential leads really fit the generated models and active sites of both targets. All the parameters required for molecular docking were set as used in actual process.

RESULTS & DISCUSSION

The three-protein complex of 3CL-Pro-SARS/CoV-2-ACE-II obtained by Patch Dock. The complex is 1394 amino-acids long and the interfaces were predicted as by the online server meta-PPISP. The positive interface regions were analyzed using Swiss PDB-Viewer tools. The Protein-protein interface mimics were used for predicting peptides similar to the interface region. There were three positive interface regions predicted. Table 1 shows the results of the positive Interface regions from the protein complex. The Interface Region-1 contained the amino-acids sequence of >6LU7; FDVVRQCSGVTF while The Interface Region-2 contained the amino acids of >6LU7; ELLQNGMNGRTILGSA and the interface region-3 containing the sequences of >6LU7; ELLQNGMNGRTILGSA. All the interface regions were present in the A-chain of SARS/CoV-2.  Based on the interface regions-1,2 and 3 mimicked as Peptide-1, 2 and 3 respectively (Tables 2 and 3). Those structures were considered as novel peptides for the inhibition of COVID-19 virus replication by Insilco high through put structure-based screening.

Column 1: AA (Amino Acid code)

Column 2: Ch (Chain ID)

Column 3: AA# (Amino Acid number)

Column 4-7: Prediction scores of cons-PPISP, PINUP, Promate, and meta-PPISP

Note: PINUP and Promate scores are scaled by 100

Column 8: Prediction of whether the residue is in an interface

(P = Positive; N = Negative; - = Burried and not predicted)

(Note: P corresponds to a score > 0.34 in 7th column; user may set a different threshold for positive prediction.)

According to study of RNA interference silences gene expression in COVID-19 through short interfering of SARS/CoV-2-RNA segments that guide mRNA degradation in a sequence-specific fashion at blocking the 3CLPro-SARS/CoV-2 active site further to block the proteins accession on infected human host cells leads to inhibit the virus replication. Hence, we developed the Structure-based drug design, based on a recently solved structure COVID-19 (PDB ID: 6LU7 its 97% identical of COVID-19 reported by Yu-Chuan Chang etal. [1] and 6VW1 is a SARS/CoV-2 3CLPro complexed with ACE-II ), for the intervention of generating novel peptides, has been derived from protein-protein interfaces of SARS-CoV-2-3CLpro and ACE-II (it’s an entry receptor for COVID-19) complexes, with the aim of inhibition of the virus reproduction by targeting the mRNAs for either blocking of the 3CLpro cleavage in SARS/CoV-2 and ACE-II interacting cellular receptors. Based on the structure-based optimization of 3-CL Pro SARS/CoV-2-protein RNAs signaling pathway in viral network, there are various sites present in 3-CL Pro SARS/CoV-2-protein.

Thus, in this study, we chose three target one is 3-chymotrypsin-like protease (3CL-protease), the main protease used to cleave polyproteins into replication-related proteins, and RdRp, the main protein for RNA replication, as the target receptors. The 3CL-protease structure of COVID-19 (PDB ID: 6LU7) was obtained from the RCSB Protein Data Bank, which was recently released on February 5th, 2020. Hence, we can give us a strong structural insight about its mechanism of action and also enable us to trace important target residues for designing drug inhibitors against the 3-CL Pro SARS/CoV-2-protein signaling pathway. The minimized free energy was calculated with electrostatics and van der Waals interactions between residues of complex, the active site and binding site amino acid can be labeled (Figure 1).

The second target was focused on SARS/CoV-2 PDB ID: 6VW1_A, these active site amino acid residues can be labeled (Figure 2). The final target was focused on angiotensine converting enzyme-II, these protein active site amino acid resedues can be also labeled (Figure 3) which is an entry receptor for COVID-19, based on the structure based optimization, the newly designed three peptides (Tables 2 and 3) separately docked with above selected three targets with the aim of blocking these three target mechanism for against COVID-19.

A molecular docking study by GOLD drug discovery software was performed to calculate of binding energy and H-bond interaction, H-bond donor and acceptors, hydrophobicity and lipophobicity at all three focused peptides with targets. In order to gain functional and structural insight into the mechanism of most active lead compounds, molecular docking simulation was performed by the aid of GOLD software. Docking simulation is a popular approach for the preliminary screening in structure-based drug design. By performing docking simulation, information on feasible conformations of the ligand within the protein binding site can be obtained. This information can also reflect the nature and quality of the interaction. In our study, we examined three target separately docked with the ligands (peptides) the grid box for docking simulation was built with enough size to enable probing into the binding with first 3CL Proteases and second SARS/CoV-2 and thired ACE-II is the first human protein whose three dimensional X-ray crystal structure is broadly used in molecular docking studies to expose the binding properties of COVID-19 inhibitors.

The peptide-3, which displayed superior COVID-19 inhibitory activity in all selected target mechanism such as 3CL Proteases/SARS/CoV-2/ACE-II enzymes, which possessed highest docking activity (Figure 4-6 and Table 4-6) among the rest of selected and designed compounds. This simulation may assist to reveal binding orientation and interaction of these molecules with amino acid residues composing active site gorge in these 3CL Proteases/SARS/CoV-2/ACE-II enzymes.

The ligand-protein interactions indicated in pep-3 with docked Confirmation of COVID-19 main protease (3CL-Pro) PDB ID: 6LU7; explaining the Ligand-Protein interactions and score having the Fitness is -136.14,      S(hb_ext) is 4.52,        S(vdw_ext) is 42.34,      S(hb_int) is 0.00 and  S( vdw_int) is -198.89 and number of hydrogen bond presented in 10, indicated that hydrogen bonding, hydrophobic and mild polar interactions are the three major interactions incorporating the attachment of this ligand to 3CL Proteases enzymes.

 

In second, Pep-3 with docked confirmation of SARS-COV-2 Chimeric Receptor-Binding Domain Chain-E PDB_6VW1_E with the potential lead Pep-3; Explaining the orientation of the blocking the SARS/CoV-2 interactions in the entry receptors through binding and activation of certain parameters such as shown Fitness is -238.78,  S(hb_ext) is 26.43,  S(vdw_ext) is 34.36,  S(hb_int) is 0.00 and   S(vdw_int) is -312.45 and number of hydrogen bond presented in 8.

In finally the pep-3 Docked confirmation of human ACE2 angiotensin-converting enzyme-Chain-A PDB: 6VW1_A: Explaining the blocking of SARS/CoV-2 interactions in entry into the ACE-II receptor through binding and activation, having the scores of Fitness is –110.02, S(hb_ext) is 10.47, S(vdw_ext) is 32.48, S(hb_int) is 0.00 and S(int) is -165.14 and number of hydrogen bond presented in 10. The preliminary results suggested that the best docking scores and the comparison of the docking sites of these Pep-3 ligands shows a near perfect dock in the overlap region of the protein pocket, the active sites inferred from three selected target proteins of 3CL-Pro/SARS-CoV-2/ACE-II are compatible with the docking score. After analyzing them on the basis of their interactions and docking values were compared to the Tin-Yun Ho, et.al, [2]. Antiviral Research 69 (2006) 70–76 reported peptides derived from spike protein of SARS/Cov and retrovirus, has been efficiently blocked the binding of S protein to ACE2, in contrast, our targeted PEP-3 has been derived from the COVID-19 of 3CL-Pro-SARS/CoV-2 and ACE-II cellular receptors especially inhibit the ACE-2 receptor entry through binding and activation.

Hence, the Pep-3 was identified as potential lead candidate drugs for the inhibitory activity of 3CL Proteases/SARS-CoV-2/ACE-II for against COVID-19, which may give to the concern in its safety and efficacy for future development of the pre-clinical pharmacological therapeutics for COVID-19.

CONCLUSION

Demonstrated the utility of in-silico structure-based drug design, we were competent to achieve the novel PEP-3 has been derived from  protein-protein interfaces of SARS-CoV-2-3CLpro and ACE-II (it’s a entry receptor for COVID-19) complexes, with indicated the inhibition of the  virus reproduction by targeting the mRNAs for either blocking of the 3CLpro cleavage in SARS/CoV-2 and ACE-II  interacting cellular receptors. Hence the pep-3shown potential anti-viral-COVID-19 activity through binding and activation.

 

  1. Chang Y, Tung Y, Lee K, Chen T, Hsiao Y, et al. (2020) Potential therapeutic agents for COVID-19 based on the analysis of protease and RNA polymerase docking. Preprints 2020, 2020020242. Available online at: https://www.preprints.org/manuscript/202002.0242/v1
  2. Ho TY, Wu SL, Chen JC, Wei YC, Cheng SE, et al. (2006) Design and biological activities of novel inhibitory peptides for SARS-CoV spike protein and angiotensin-converting enzyme 2 interaction. Antivir Res69:70-76.
  3. Zhao H, Zhou J, Zhang K, Chu H, Liu D, et al. (2016) A novel peptide with potent and broad-spectrum antiviral activities against multiple respiratory viruses. Sci Rep 6: 22008.
  4. Tang B,He F, Liu D, Fang M, Wu Z, et al. (2020) AI-aided design of novel targeted covalent inhibitors against SARS-CoV-2.BioRxiv.
  5. .Khaerunnisa S, Kurniawan H, Awaluddin R, Suhartati S, Soetjipto S (2020) Potential Inhibitor of COVID-19 Main Protease (Mpro) from several medicinal plant compounds by molecular docking study. Preprints, 2020030226 Available online at: https://www.preprints.org/manuscript/202003.0226/v1
  6. Guangdi L, Erik C (2020) Therapeutic options for the 2019 novel coronavirus (2019-nCoV). Nat Rev Drug Discov19: 149-150.
  7. Robson B (2020) Computers and viral diseases. Preliminary bioinformatics studies on the design of a synthetic vaccine and a preventative peptidomimetic antagonist against the SARS-CoV-2 (2019-nCoV, COVID-19) coronavirus. Comput Biol Med 119: 103670.
  8. Kaliyaperumal R, Mudugal MP, Nagaraja P, Taj N (2018) Exploration of novel sulpho tyrosine based unnatural amino acid chemical space for allosteric site of human shp2 protein tyrosine phosphatase: A Computational Approach. J Drug Des Med Chem4: 22-34.
  9. Rajaganapathy K, Manavalan R, Kannan K, KalaichelvanVK (2014) Histone Deacetylase Inhibitor Based Novel Lead Molecules and its Analogues: A Molecular Docking Ap-proachInventi Impact: Molecular Modeling2014.